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Paiametdo multl-^hidinel 



The iuveatlon xektes to a method of encoding a muM-chaim&l atldio dgosi^ m 
encoder for encoding amubi-ob^smel audio signai» an apparatus for supplying an audio 
signal, an encoded audio aignaL, a ^tocage medium on wbidi the encoded audio aigaal is 
nbpxesd, a mefhod of decoding an encoded audio signal, a decoder for decodiiig an eroded 
5 audio signal^ and an apparatus for supplying a decoded audio signal. 



EP-Ari 107232 discloses a parametric coding scheme to geneiiate a 
representation of a stereo audio signal "wkdidlx is composed of a left cdiannel signal end a right 

10 channel signal. To effidenHy udlLoe tiansmissioti bandtsrldtb, such a re|n:6Sentati0n contains 
information concerning oidy a monaural signal v^bich is ^ther Hie left channel signal or the 
right channel ^gnaU and parametric infoimation. Ute olher stereo signal can be recovered 
b9^ed on the monaural signal togetiher vHth the parametric infoimaiion. The paramelric 
information comprises locaIi2;ation cues of the stereo audio signal^ inoludbig intensi^ and 

IS phase characteristics of the left: andlJie ri^ channel. 



It; is an object of the invention to provide a parametrlo multi-channel audio 
astern which is able to scale the qt^ality of the encoded audio signal mHi the available bit 

20 rate or to scale the quality of the decoded audio signal with the compleadty of the decoder or 
liie available transmisi^n bandwidOL 

A first aspect of the invention provides a metbod of encoding a multi-channel 
audio signal as claimed in claim 1. A second aspect of the invention provides a method of 
encodi!^ a mnlti-ohannel audio signal aa claimed in claim 2. A third aspect of the invention 

25 provides an encoder for encoding a multirohannel audio signal as clauned iu claim 14. A 
fburfli aspect of the invaotion provides an encode for encoding a multi-channel audio signal 
as claimed in claim 1 5. A fifth aspect of the invention provides an apparatus for supplying an 
audio signal as clafaned in claim 16. A sixth aspect of the invention provides an encoded 
audio signal as claimed to claim 17. A seventh aspect of the invention provides a storage 
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2 16,07.2002 
xnedimu on vMch the encoded signal is stored is claimed in claim 18. An dght aspect of tha 
invention provides a method of decoding as claimed in claim 19. A ninth aspect of the 
Invention provides a decoder for decoding m encoded audio signal as claimed in claim 20. A 
tentti aspect of tbe invention provides an apparatas tot supplying a decoded audio signal as 
claimed in claim 21. Advantageous embodiments are defined itx the dependent claims. 

In the me&od of encoding a multi-channel audio signal in accordance with the 
first aspect of the invendon, a sinj^e channel audio signal is geno^ted. FUrd^er^ is^xmation 
is geiierated ftom the multi-channel audio signal allowing tecovering, with a required quality 



)nJ^emMY,j3&jnfoimaUQfi^ 



ftomEP«A4l07232. 

In accordance with ihe first aspect of the invention^ the hxfbrmation is 
generated by deteimimng a first pordon of flxe information for a first firequeaoy region of the 
multi-cbannel audio signal, and by determining a second pordon of the infbrmation for a 

IS second &equenoy region of the multi-*chamiel audio signal. Ihe second firequency region is a 
portion of the first fiequency region and thus is a sub-range of the first firequency regjion. 
Now, two levels of quality of decoding are possible. For a low quality level of the decoded 
multl-ohannel audio sl@:ial, the decoder uses the encoded smgle diannel audio si^ial, and the 
first portion of the infcnmatlon. For a higher quality level, the decoder use? the encoded 

20 single channel audio signal^ and botb.tbe first and the second portion of the information. Of 
course, it is possible to select the decoding quality out of a multitade of levels if a multitude 
of portions of information each being associated witb a difSistent frequency re^u are 
present For example, the fibrst portion may compdse a single set of parametess determined 
witiun a frequency region which covers the fiall bandwidth of the multt-channel audio signal. 

25 And the second portion may comprise several sets of parameters, earfi set of i«ffameters 
being deteimined fer a sub-range or portion of the ftdl bandwidth. Together, the portions 
preferably covet Ae fiUl bandwidth. But many other possibilities e>dsL For example, the first 
portion may comprise two sets of parameters, the first set being determined fi>r a firequency 
re^on which coveis a lower part of the fhll bandwidth, and the second set being determined 

30 for a frequency region covering the other part of the fiall bandwidfiau The second portion may 
comprise two sets of parameters detannined for two frequency re^ons wiiUn the lower part 
of ihe fWl bandwidth. It is not required tixat the number of sets of parameters &r the low» 
part and the higher part of the full bandwlddi are equal. 
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This sepresentadoft ofihe eoooded attdio signal allows a quality of ihe 
decoded awdio signal to depend on me compleKlty of the deoodet , For example, ia a sii«plc 
portaWe decoder a low oomplexi^ decoder may be used v^Jioh has a low power consuiE«»fioa 
a,^ ^ 13 therefore able to use only part of ihe information. In ahl^ end application, a 
compUK decoder is ijsed vAudx iiaes aU llie information avalable in fl»e coded slgnaL 

The quali^ of ite decoded attdio can also depend on the acvailaW 
. . , .^j-^jxt T*rt.«+«„em<(iB?rtnhandwidi!i is hiehthe decoder oaa decode all 



a^raaablelayera. sinoeli«>y are all transmitted, If the transmission bandwidtii is low ihe 
tiaasmitber caa dedde to only transmit a limited anmber of layers. 

la a second a^eot of flie invention, fte encoder xecdves a maximum 
allowable bit rate of 4e encoded m«Mi-ehannel audio signaL This maximum allowably bit 
rate may be defined by the available bh rate of a transmission djan^ 

a storage medium. In appUoations ^H*fiWin the iiansmisw^ 
the maaUmum aUowable bit rate chaages in time, it is important to be able to 

fluctaationsof ibe transmis^onbandwidlh to pr^ a very low quality of tije decoded audio 
signal. KonnaUy , Ibe encoder encodes all av^able layers. It is decided at ^ transmittiae. 
end what layers to transmit, depending on lhe available channel cap«aiy . It is posjdble 1» do 
twa wifli the encoder inthe loop, butlMs is more complioatedtiiat just stripping some layers 

pcioE to transmis^n. 

The encoder only adds the second portion of the injawmatloa fiw the second 
fteqnency region of the mnlti-channel audio signal to the encoded audio signal if a bit late of 
the encoded multi-channel andio signal v*ich comprises the single chaaud audio signal, and 
tbe frst and second portioa of the information is not hi^ Hian the madmmn allowable bit 
rate. Thus, Uib sectmd portion is not present in the coded audio signal if the transmissioa 
25 bandwidfli is not large enoui^i to si^port flie transmission of the second portion. 

In an embodiment as defined in daim 4, ihe in&maation comprises sets of 



parametesa, each one of the portions ot the miormanon is rei»B»ca«u «jr »^ — 
Th«ft««Aerofaetsofpammeter8dependlngonthennmberoffi«qwncyzegio^ 



present in the portions of ihe iofcnnation. 

In an embodiment as defined in claim 6, &e sets of pat smeters comprise at 



30 

least one of the localization cqes. 

la an embodiment as del 
^.Vcrf.....«4<.ii'<r f^nvATfi th^ fiill bandvndth 



squired 



» 
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audio 3ig!ml iixto the multi-chmmel audio signaU In this way a baslo level of quality of the 
decoded aodio signal is guaranteed. The second frequency range covers pasi; of tlie full 
bandwidth. In this way, the second portion when present in the coded audio signal improves 
the qualtiy of the decoded audio signal ia this £:equanoy range. 

In an eiiibodim^ as defined in daim 8» tbe secoM portioa of i3ie information 
coioprises at least two frequraicy ranges which together substantially cover the fUU bandwidth 
of ih.e multi-<iha»nel audio signal > hx this way, the quali^ improv^nent provided by ihe 
second portion is present over the complete bandwid^.. 

hi m em bo d i ment - as-defme d- in -o lahn - g ^ i Lfei — 



JJ3 .singieLchanneLaudio-signaL^ii^ 

encoded audio signal. The enhancement layer which comprises the second poildon of tiie 
inforEoation is encoded only if the bit rate of the encoded audio signal does not exceed tbe 
maximally allowable bit rate. In this way, the quaUly of the decoded audio signal will dep 
on the msKinially allowable bit rate. If the maximally allowable bit rabe is too low to 

1 5 accommodate the enhanc^aiyent layer» the decoded audio signal wiU be obtained &am the 
base layer which will produce a belter quality of the decoded audio than will be the case if 
unpredictable parts of the coded audio wiU not reach the decoder. 

In the embodim^Q as defined in any one of the claims 10 to 12» the portiona 
of the inforxnadon (usually containing sets of parameters^ one set for each fiequencgr band 

20 represented) in an^t &ame are coded based on the parameters of the prervious ficame. 

Usually, tins seduces the bit rate of the encoded portions of the information, because, due to 
correlation, the information in two successive fiamea will not differ substantially. 

In the embodiments as ddSned in daim 13, the dlifference of the parameters oi 
two succesi^ive &ames is coded instead of the paramet^s itself. 

25 Prior solutions in audio codens thathavebeen suggested to reduce the bit »te 

of stereo program matexial inetude inteosity stereo and M/S stereo. 

Jxi the intensity stereo algorithm, hi^ :0cequencies (typically above 5 kHz) are 



dependent scale f^tots or intensity &ctors which allow to recover an decoded audio signal 
30 vdiich rambles the onghial stereo signal for these firequency regions. In the M/@ algoritiun, 
the signal is decomposed into a sum (or mi4 or common) signal and a difference (or side, or 
uncommon) signal. This decomposition is sometunes combmed with principle component 
analysis or time^vdrying scale factors. These signals are then coded independently, either by 
a transform coder or sub-band coder [which are both waveform coders]. The amount of 



D 
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infonnation xeduotioti achievod toy fliis algoritbm stamgjy depends on tbe spalial properties 
of tiiB source slgnaL For example, if iJ» aomce ^gnal is monaaral, the difference ^goal is 
zeto and can be cBscaided. Hovrevffl?^ if the conelatiott of the left and ti^t audio signals is 
low (vrfiidh Js often the case for the higher fiiequency regions), this scheme offecs only Utfle 
5 latraJe redaction. For the lower ftequencyregtomM/S coding ganeraUypr^^ 

merit 

Parametrio desmptlons of audio signals have gained interest during the last 
years, espedally in the field of audio coding. It has bean shovwx tiaat transmitting (quantized) 
parameters that describe audio signals requires ouly little transmis^on c^adty to re- 
10 ^yn&esize a perceptuaUy equal signal at the receiving end. 

audio coders fbcus on coding nionaiiral signals, and sCBEeo ^gnals are pd^^^ 

mono signals. 

These and other assets of the inveniion are apparent £tom and v^ 
ehKadatedvvhihr^erenceto the embo^Bments desedbedhet^nafi»r. 

15 

lathe dta'wiBgs: 

Pig. 1 shows a blodc diagram of a m\jlti-chaimel wacoder tot steieo audiOp 
Fig. 2 sliows a block diagram of a multi-chaimel decoder fbt stereo 
20 Hg. 3 shows a repiresentalio]! of tiie encoded data stream^ 

Fig- 4 shows an fflmbodimeat of the frequency tangep in accordance with the 

invaition. 

Fig. 5 shows another embodiment of the frequency ranges in acMrdance with 

theinyeaition, 

25 Fig, 6 shows the detenmnadon of the sets of paxameters based on paiiameters 

in a previous frame in accordance wi<h an embodiment of the invendoiLi 

Pig. 7 shows a set of paramet^Sf 

Pig. 8 shows the differential detemunaiion of &e parameters of the base layer> 

and 

30 Fig. 9 shows the differential determination of the paramet^s corresponding to 

a frequency re^on of an enhancement layer. 
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Fig« 1 sho^ a block diagram of a mulli-cIiaQriel encoder. The encoder 
receives a multL-chatmel audio signal wMdi is shown as a stereo signal RI, li and the 
encoder si^plies the encoded multi-channel audio ^gnal BBS. 

Tb^ down mixer 1 comhix^S the stereo signal or stereo channels RI, Lt into a 

5 sin^ diannel audio signal (also referred to as monaural signal) SC. For example, the down 
mincer 1 may detennine the average of the input audio sigxmLs lU, LI. 

The encoder 3 encodes the monaural signal SC to obtain an encoded monaural 
signal ^C. The encoder 3 may be of a known Idtad, for example, an MPEO coder (MPEG- 

jUB^ ^BG - LIII (mp3 ) 5 or M PB O a ^ AA C) - ~ 

10 XhCLparametMjdetfflcmining-circui^^ 

. . . characterklng.the iofonnation INF based on the input audio signals RI» LL Optionally, the 
parameter determining circuit 2 receives the masdmum allowable bit rate MBR to only 
determine the parameter sets Sl^ S2, . . . which when coded by the parameter coder 4| together 
with the encoded monaural signal BSC do not e^cceed tbe maximum allowable bit rate MBR, 

15 The encoded parameters are denoted by BIN. 



20 by way of example^, with respect to an eDobodimcnL Hie multi-channjel audio signal LX» RI is 
encoded in a smgle monaural signal SC (fimher also referred to as single channel audio 
signal). The parameten^tion of spatial attributes of the mnlti«dianml audio signals IJ, RI is 
peifi>rmed by the parameter defmnining circuit 2. The parameters contain information on 
how to restore the multi-chaimd audio signal Lit BI from the monaural signal SC. The 



25 patameters are usually encoded by the parameter encoder 4 before combining them wiih the 



combined coded signal is the encoded multi-channel audio signal BBS. The transmission or 
storage capacity necessary to transmit or store the encoded multL-diannel audio signal BBS is 
30 strongly reduced compared to audio cod^s that proc^ tiie xnulti-ohannels independently. 
NeverfhelesSs the original spatial impression is maintained by Ihe information INF -which 
contains the (sets oO parameters. 
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7 16.07^2 
In paitlculsr, fhe jparametric description of multi-ctiannel audio SI, II b 
related to a bissaoial p£bce33ins model wbidbi aims at describing the effective signal 
processing of the hiaaural auditory system. 

The model ^pUts the incot»ing audio LI, RI into s^^ 
S which, preferably, are spaced linearly at an HRB-mte scale. The bai^dwidth of these signals 
deprada oa the ceoter &e<iuency, following Ihe ERB-rate. Subsequently, preferably, for every 
firequecucy band, the fbllorwing properties of the incoming signals are analyzed,' 

_ The int^nral level di£&a:ence» or ILD» defined by the relative levels of the band-* 
Unidted signal stemming fiom the left and nght ears, 
10 itieint^ural time (or phase) differeiicer^ 

(or phase shift) corresponding to the peak in tbe intezannd ocoss-coirelation fonctlon, 
and 

- The (£s)shmlaxity of the waveforms that can not be accomited for by ITDs or ILDs, 
which can be param^teiized by the maximmn interanral cross-cottetation IC (for 
IS example, the value of the cross-conelation at the position of the maximum peak). 

the sets SI, S2, .« . of the tbree parameters, one set for each fiequency band 
ERl, FR2, . . vary over time- However, since the binam^ auditory system is very slug^sh 
in its processing, the iQpdate xate of these properties is rather low (typically tens of 
miUisecon(te). 

20 It may be as^qmed that the (slowly) time^varying paramet^ are the only 

spatial signal properties that the bhiaur al auditory system has available, and that fiiom these 
time and &equency d^endent parameters, the perceived auditory world is reconstructed by 
higher levels of the auditory system. 

Fi£» 2 shows a block diagram of a maltiHcdiannel decoder* The decoder 

25 receives ti3e encoded multi-d^annel audio signal BBS and supplies the recovered decoded 
xnulti'<^hannel audio signal which is shown as a stereo i^gnal RO, LO. 

The deformatter 6 retrieves the encoded monaural signal BSC* and the 
encoded parameters BIN' £romthe data stieamEBS. The decodecr 7 decodes the encoded 
monaural signal BSC into the output monaural signal SCO. The decoder 7 may be of any 

30 known kind (of oamse matched to tiie encoder that has been used), for example, the decoder 
7 is an MPEG decoder. The decoder 8 decodes 'ttie encoded parameters BIN* into outoof 
pai^met^s IMO. 



« 
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The demultipleiser 9 recoveis tbfi oii^ut stereo audio signals hO and RO by 
^plyiagfheparamctesrsetsSUSa, ... oflheouljMtparameierslNOonljieoutpmiaonaOTal 

signal SCO. 

Fig. 3 shows a jepresetrtatioa of ihe eacoded data atream. For example, in each 
ftasn© Fl. F2, the data package starts with aheadsc H foUowed by ihe coded mam&l 
signal ECS now indicated by A, a l9ist portion P I of the encoded in&nnation BIN, a second 
portion P2 of the encoded infbnnafion EIN, and a third portion P3 of the encoded infiaaanation 

BIN. 

Tf f^ft ftnn ^ PI. P2. .. . only comaases Hie header H and the c oded monaacal 



_I0 sf gnal ECS, only the monacEal slml SC is transmitted. 



15 



20 



75 



30 












As disclosed in BP-A-l 107232, the fhU frequency band in -vvhich the ii^nt 
etodio signal oecucs is divided mto a plurality of sab<-fiequency bands, whidi together cover 
«ua AtU 4T«a»enbv bantL In ths tetminoloav in accordance with the invention, the miilli- 



duomeli 



aub-ftequency band PHI, FR2, .... TMs ploraiily of parameter sets SI, 
first portion Pi of the encode in&rmaticMi BIM. Thus, to transnsit a basi 
channel audio signal, the bit stream oompnses the header H» the portion 
monaural signal BOS and the first portionPl. 



jiment of lihe invention, tha firat 



In fbe bit stream hi aocordaoce wxdi a 
portion PI ooi^sists of a single set parameters SI, only. The single set b^ng determined for 
ihe ML bandwdthPRl. This bit stream viM<d[i comprises the lisader H ^ 
P I provides a basic layer of quality, indicated by BL hi Fig. 3. 

To sispport an enhanced quality, fijrfhar portions P2, P3 of the coded 
information BIH sse present in the bit stream. These fiother portions ferm an enhancement 
layer EL. The bit stream may con^irise a single furdier portion P2 or more than 1 fiwther 
portion. The fioUier portion P2 preferably comprises a plurality of sets S2, S3,,., of 
parametws, one set for each sub-frequency band FR2, FR3. the sub- firequcocy bands 
PR2. FR3, ... preferably covering the fhll frequency band FRl. The enhanced quality may 
also be present in a st^?^se manner, a first enhancement level is provided by the 
eijhancemeat layer ELI which conapSses the first portion. And a second enhancement layer 
EL comprises li,e first enhancement lay er ELI and the second ^cement layer EL^ which 

coxcsxds^ the poriion P3- 

Tbfi fiirther portion P2 may also compdsQ a single s^t $2 of parameters 
oonespoiMiiog to a siogle ftequency h&ad PR2 vchich a sab-baad of the fliU fieqaenoy basad 
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FRl. The fiwttieE pordon P2 may fiOso compriso a number of sets of paarmeters S2j S3, ... 
wMob coffeapond to frequency bands FR3, ... wbicli together do oot cover the complete 

flail frequency band FRl. 

The further portion P3 preferably contaim pacaEueter 3els for frequency hmds 

§ which sub-divide at least one of the sub-bends of the furi^her portion P2. 

This f onnat of the bit stEeam in aoeordance vdiSx the iixvention allo^ at the 
teapsmission channel, or at the decodes to scale the quality of the decoded an^o sigpal wth 
the bit rate of the transmission channel, or the decoding complexity of the decoder. For 
example, if ^ decode should have a low power consumption^ as is important ia 

10 portable applications, the decode may have a low cois^lemtj^ and only uses the portions 
A and Fl. It would even be possible that ihe decoder is able to perform more comple^g 
operations t(t a higher power consumption if the user indicates that he desires a higher quality 

of the decoded audio. 

It is also possible that the encoder is aware of the maximum allowable bit s?ate 
15 MBR. whicb may be transmitted via the transmission diamiel or which may be stored on a 
storage medium. Now, the encode is able to decide on how many, if any» fbrdier portions PU 
P2, ... fit wiiMn the nmximum allowable Ut rate ^ 
allowable portions Pi, P2^ ... in the fait stream. 

Fig, 4 shows an embodimexd: of the frequency ranges in accordance with the 
20 inv^ention. this embodiment, the frequency band FR^ 

of the multi-channel andio signal II, RI, and liie frequency band FR2 is a sub-frequency band 
of the full bandwidth PBW. 

If these are the only frequency ranges fbr which parameter sets S 1 , S2, , . . are 
determined, a single parameter set SI is determined for the frequency band FRl and is 
25 piesent in the portion PI , and a single parameter set S2 is detennined for tihie frequency band 
PR2 and is pies€Srt in liie portion P2. The quality scaling is possible by either ushig or not 

using the portion P2. 

Fig. 5 shows another Mabodiment of the frequency ranges in accordance with 

l3ie n$ventioxi« In this embodiment^ the frequency band FHI is again equal to the full 
30 bandwidth FBW, and the sub-frequency bands FR2 and FR3 together cover the frill 

bandwidth FBW. Or said in other words, the frequency band FRl is subdivided into the sub- 
frequency bands FR2 and FR3. 

If these are the only frequeocy ranges fr)r which parameter sete SI, S2. . . . are 
detemunedf the portion PI comprises a single parameter set SI determined fr)r de frequency 
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band FRl, a»d the portiatt P2 comprises tvwo paiaaieter seta S2 and S3 detemrined fia ih© 
frequefloy band PR2 and PRS, jespectively. The quality scaling ia posable by either uang or 

not losing the poition P2. 

Fig. 6 shows the deteraiioation of the sets of paiameters based on parameters 

in a tnreviovis fiame in accordance -with, an embodiment of "die iavaotion. 

Fig. 6 shows a data stream which compiisos in each frame Pl^, T2» , , . the 
coded information BIN "vi^ch comprises ihe portion PI which is part of the base layer BL 
and the portion P2 whidi &maa the euhaocement layer EL. 

LE2,Jby3Bay of.Bxampleyixamptises- 



, 1 




y 




mi 



tcm sets of parameters 32, S3, S4» S5 wMcb are ddtemimed for th.e sub-&eqaency bands 
FR2> FR4, PR5, respectively. The four sub-feequeacy battds FR3, FRA, FRS sub- 
divide the fiequeiu^ band VKl . 

In the fcdxne F2 whioh succeeds the fiame F U the p onion P 1 con^prised a 
15 singld set of parameters S V which are detemnned for Ihe ML bandwidth PRl and are part of 
the base layer BL' . The portion P2 comprises fbur sets of parameters S2% S3 % S4% S5' which 
are again delmninedfor the sub-fteqaency bands FR2, PR3, FR4, FR5, Tospectively and 
which fom the eohaneemeat layer BL'. 

It is possible to code each of the sets of parameters Si, S2^ fbr each caie of 
20 the feumes F 1, F2s , . . separately- It is also possible (o code the sets of parameters of the 
portion P2 with respect to the parameters of the portion P 1 . This is indicate by the arrows 
starting at SI and ending at S2 to SS ia the fi»me FL Of con^^ 
other ftames F2, - . . (not shown), hi the same maimer, it is possible to code the set of 
parameters SI' with respect to SI . And finally, the sets of parameters S2', S3', S4', S5' may 
25 be coded with respect to the sete of parameters S2, S3, S4, S5, 

hx tWs manner, the bitxate of the encoded information EIN can be reduced as 
the reduru^cy or correlation between sets of paiameters Si is used. 

Preferably* the newparametca:s of the new sets of parametei^ 51% S2% S3', 
S4% S5 ' are coded as the difference of fhek valne and the value of the parameters of the 
30 previous sets of parameters Sl> S2. S3, S4, S5. 

At regular time intervals, at teat the paramet^ set SI has to be coded 
absolutely and not differential to prevent errors to propagate too long. 

Fig. 7 shows a set of parameters. Each set of parameters Si may comprise one 
or more parameters. Usually the parameters are localization cues which provide information 



15 



25 
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about the localizoHcn of sound objects ia tbe audio ia&nnation. UsuaUy the looalizadon <mes 
ase lie interaiiial diflfet^ 

and the ioteEaural aoss-ooscdatioft IC. Moie detailed infiMmalion on tJjese patanofiiers is 
provided in the Audio EngLneering Society Conveasioa P^er 5574 "Binaural Cue Coding 
5 Applied to Stereo and Mvdti-ciiamel Audio Compression" pceseatBd at tbe 1 12*^ Conveation 
2002 May 10-13 Munich, Genaaay, by Chnstof Falley et al. 

Fig. g shows Si© dififera>lial detenninaliofl of apaiameter of the base layer. 
The horizontal axis indUsates soccesave Sfaaaes F I to F5. The vertical axis sho^w the value 
PVO of a paf ameter of the set of parametees SI of the base layer BL. This paiasneter has the 
10 values Al to AS for the fiaraes Fl to F5 respectively. The contribution of this parameter to 
the bit rate of ihe coded in&Hnation UN wll decrease if not the actual values A2 to A5 of 
the parameter ai» coded but the smaller difBaiences Dl, D2 

ilial detenmnatiow of the parainflfcerfi corresponding to 



Mm- 



Breqai 



fiames Fl and F2, The vertical axis in<Kcatss the values of a pajtlcular paraaieter of 1h© baa© 
layer BL and the enhancement layer EL, In this eseample, the base layer BL comprises the 
portion PI of in&madioa INF -with a singje set of parametess dfitennined &r the full 
ftequsncy range FBW, the particular parameter of the portioa Pl has the value Al for the 
ftame Fl and A2 for Ihe frame F2. The enhanoemmt layer EL comprises the portion P2 of 
20 1nfi>"^p»t"" INF -with three sets of parameters determined for three respeotiv© frequency 

ranges FR2. FK3, FR4 vMs^ti together fill the fUU frequency range FBW- The three partioular 
peramebss (for example, the parameter representbig the JLD) have a value Bl I, B12, B13 in 
the frame Fl and a value B21 . B22, B23 in the frame F2. 

The contribution of these pasameters to the lat sale of Ae coded infosmation 
Em wiU deosease if not ft© actual vahies BU to B23 of the particular parameter a» 
but&e diJ^imcas Dll. D12,.... because these differences can be encoded more efScieally 

than the actual values. 

To amnmasize, in a preferred embodhnent in accordance wilii the invention, it 

is proposed to organisaa the stereo parameter hif onnadon INP aueh Ihflt a base layer BL 
30 (xiiiiains one set ofparamsters (preferably the time/level di^Ba^^ 

v*lch is detemiined for the foil bandwidth FBW of ihe multl-dhaanal audio signal LI. RI. 
The enhancement layer BL contains mqltiple seta of parameters S2, S3, . . . vdndi conespond 
to subsequent feeqpjenoy mtervals FB2, FRS, . . . wifeia the foU bandwidth FBW. For bit-rate 
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The infounatioa INF is encoded in a multi^ayered manner to ^ble a soalii^ 

of the decoding quality veisus bit rate. 
5 To conclude, in the tiow foUoMwng, an preftoed embodiment in accoxdaoce 

-wlHi the invention is ducidated wiOi respect to program code and its eluddalion. 

Piist, for all 3Tibfi»mes (the portions PI, P2» ...) inliie frames Fl, F2> ... the 
data BSC fbr monaoxal represmiiatioa SC, the data BIN fbs the set of stereo parameters SI 

IQ regions;^ FR2. FR3. ... is determined. 

Hie pro-am code is ^wn at the left hand iside, and an elaoidation of the 
program code is provided under description at the xigbt hand dde. 

code desgtipfticn 



15 { 



{ 



fbr (f « 0; f < xaoijasoesi fH-) 



for all fiames do: 



{ 

e3£an^ejcnonoj6came(f) 




20 



25 



example j5tMeo_eJCtension_layer_lC0 



exaniple_sterB0j»ct«asioa__layer_2(Q 



get data stereo parameters 
Ml bandmdth (tiiB 
portion Pi) 

get data stereo parameteis 
frequency bms (the 
portion P2) 



} 



30 } 
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code 

exampl&„stBreojBxlfflmi(mJayw„l (0 
{ 

xe&^jsteceo 



desoiiptiotL 



Itit tl ttnfttiT\g -whethfif or not data is to to 
absolutely coded or not 



10 



15 



20 



if (r^teah,ateteo = 1) 
{ 



} 

{ 
> 



if data is to be coded absolutely 

code the aotud inlBiauxal iisten^ 
difEeteoyoeCild) £>r the wbole fiequenoy ate» 
(global) 

ifnotaxe&asb. 



ildjglobal_di£^f} code ildwtb inspect to the pcevioTOftame 



} 

Thirdly, dependiiig oathfi -vaUi© of the lHt»aae8h_steceo the steieo parametets 
for aU of the fiequency bins are coded absolutely (the actual value is coded) or 13ift distance 
with the conesponding parameters for Hie fiiU bandtwidth is coded. The fijllowiug code is 
valid for tiie intetaural level difference ILD. 



■ J* 



ifre&esh 



code 

25 ecaaiplfi_steaceo_e)cteii^oftj^e^_2(0 
< 

ifiCiefeesh_stereo=^l) 
{ 

forO>«Oi b<mof JjioK b++> fbr all ftequenoy bins 
30 { 

ildj^^b] code the ild in that bia relative to ^ 

global value 



} 



} 
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else 



if no re&eah 



{ 

{ 





The tecca ^^efre^h^stereo" is a flag dcoooting whetbior or not the stereo 
parameters should be re&^^ed (0 ^ FALS6» 1 » TRUE). 

The teem ^d^global^^f]'* lepresents the HufEinan encoded absolute 
repi^entation level of the ILD for the whole £cequeiiGy area, for frame £ 

The teem "ild^lobal_dlfiE[q" represents the Huffinan encoded relative 
representation level of the ILD for the whole fcequenoy area for fiaxne f. 

The term "ildjbm[f; b]" repiesents the Huffinan encoded absolute 
representation levd of the ItD for frame f and bin b* 

The term *nidJ^inj3if3E[f, b]" representp the Huffinan encoded relative 
representation level of lihe ILD for frame f and bin b« 

It should be noted that the above-mentioned embodiments illustrate rather than 
limi t fhe invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing &om the scope of the appended claims. 



the CKtenMon to a more than two channel audio signal can easily be accomplished by the 
skilled person, 



oonstraed as limiting tiie claim. The word "comprialng" does not exclude the presence of 
elements or steps oth» than those listed in a <daim. The invention can be implemented by 
means of hardv^e conqnising several disthict elements, and by means of a suitably 



meajw can be embodied by one and the same item of hardware. Themete feet that rartedn 



Althoi^ the invention is eluoidated in Ihe Pigs, with respect to a stereo signal. 



In the claims, any reference sigc^ placed between parentheses shall not be 
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measures are recited in xnutually difi^rent dependesit oiaims does not iDdicate iSxst a 
combination of tbiese measures camxot be used to advantage. 

In summary^ multi-ohamiel audio ^go^s are coded into a monaural audio 
sig»al and infonnatlon allcwing to recover tiie multi-cbaimel audio signal from the monaural 

S audio ^gnal and the information. The information id geoerated by determisung a first portion 
of the information for a frequency le^on of the muM-ohannel audio aignaU and by 
detecmining a second portion of tiie information for a second frequency region of Hie multL*- 
channel audio signal* The second frequency rej^on is a portio^^ of the first frequency region 
and thus is a ffub-rang^ of the firat frequency region. The Infcmnation is multl>*layered 

10 enabling a scaling of the decoding quality versus bit tate. 
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CLAIMS: 



I ^ A method of encodmg a muM-ohaimel audio agnal oon^rismg at least two 

audio ohaimeils. the method conapnsmg, 

gooerating a angle dhannel auciio signal and encoding the single clieinndi atidio 
signal iiito a bit stx^on as an encoded single channel audio dgnal» 
■5 geneca&)g infoimatton irom tbe at lei^t two audio cisaoneis auowing 10 

audio idgnal and ibd infi>Emfldcni» tbe gqoexating of the in&imatioxi compiising» 

detenmning a fiist pordcm of ^ Infonnotion for a fbrst fceqnency region of the 
mnltirduumel audio signal* and eaoooding the fast portion of the infonoaation into the bit 
10 stream as an aocoded fiist pmlioa of the infonnotioii, and 

detenninii^ a second portion of the inferaiadon for a second fieq^tenoy rei^ 
of ibe znulti-Gbaimel andio aigDal, the second ficequenoy xe^n being a porljon of the first 
ftequency re^on» and aicoding tbe second pordon of Ibe infonnation into tiie bit stream as 
an encoded second portion of 'die infonnadooi 

15 

2. A method of enoodwi^ a multi-channel audio signal comjMnwng at least two 

audio channels, the method compri^ng» 

generating a ^ngle chattnal audio ^gnal, 

genra^atinginfozmation&om the at least two audio channels allowing to 
20 recover -wilb a required qualily level tiae muM-cbannel audio signal fiom ibe single channfil 
audio ^gnal and tbe in&imation, the generating of the information comprising, 

receiNing a maximum allowable bit rate of the encoded mnlti-ohaim^ audio 

sigi^and 

only determining a &st portion of the in&nnation for a first fieqoen^ reg^ 
25 of tbe mnlti-channel audio signal if a bit rate of tbe encoded multl-ohannel audio signal 
comprising the single channel audio signal and the first portion of the information is not 
bi^ier than ti:^ maximma allowable bit rate. 
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3. A method of encoding as oUimed in clmm 1 or 2^ wherein the single channel 

audio sigoal is a particular combinadon of the at least two audio channels. 



4, A in«thod of eiicoding as clainied in dm 

S infoxmafion comprises sete of pajraxneters, the first portion oompiises at least a first one of iho 
sets of paiameteKSj the second portion oonqoiises at least a second one of the sets of 
paramet^s^ wherein each set of parameters is Msociated with a corresponding frequency 
region. 

10 5. A method of encoding as claimed in claim 4, dbtas^acterized in that the sets of 

parameters comprise at least one localization cue. 



6» A medhod of encoding as claimed in claim 5, characterized in lhat the at least 

one localization caue is sdected ftom: an intraoral level difference, an interanral time or 
13 phase difference, or an Interaural cross-correladon. 



7. A method of eiicodixig as claimed in claim 1 or 2, characterized in that the furst 

frequency region covers a fiill bandwidth of the multi^annel audio signaL 



20 



25 



8. A method of encoding as claimed in claim 1 , characterized in that the first 

fireqnency region substantially covers a fUU bandwidth of the multi-channel audio signal, the 
second fiequency re^on covers a portion of the full bandwidth, and ini that the detetmining of 
the second portion of the mformat^on is adapted to determine sets of parameters for both the 
second frequency region and a set of fbrfb^ fiequency regions, the second fi^quency region 
and the set of fivlher fiequency regions substantially covering the full bandwidth, wli«e in 
the set of further fieqaency re^ons comprises at least one further frequency region. 



9. Amcthod of encoding as claimed in claim S, characterized In that the single 

channel audio signal and the first portion of the information form a base layer of information 
30 whidh is always present in the encoded multi-channel audio signal^ and in that the nciethod 
comprises receiving a masdmum allowable bit i^te of the encoded multi-channel audio signal, 
the second portion of &e information forming an enhancement layer of infonnation which is 
encoded only if the bit rate of the encoded base layer and enhancement layer is not higher 
than the maximinn allowable bit rate. 



PHKIj020692EPP 



18 



• 



024 16.07.2002 16 



16.07.2002 



10, A method of ©noodJng as claimed in claim 4. 

4ete»niiimg of Ihe first portioa of infonnalioa in apardculai: ftame of encoded infiaanatioa 
comprises deteimiciog ±s> first one of the sets of parameters in the particular fiame. and 
coding liie ftcst one of ihe sets of parameters based on flie first one of the sets of parameters 
of a fiame preceding the paidculai: &ame. 

11. A method of enoodiixg as claimed in claim 8, characteri2!ed in 

j^^^t^Q /.f i v o p^i^ftT. nf iTifarniation in a paKlomaf J:mg Of 

if ifhrmfiti^ romFiff*^" H«^fwtt^'^*«Fi ^« <^ o f parameters of the seoon d portion in Ifae 



partioqlar ftame and coding the seta of parameters of the seoond portion in the particular 
ftame based on the sets of parameters of a 6ame preceding the particular frame. 



20 



25 



30 



12. 



A mefliod of encoding as claimed in claim 8, ctocaoterized in that the 



IS detemuning of the seoond portion of 



ia a paiticolax fiam& of fbe eiicod^ 



tofonnation comprises detenntoing the sttB of pa^amflters of the seoond portion in Ito^ 
particular frame and coding the sets of parameters of ifae second portion in the particulw 



frame based on the first one of the sets of paran 
firame* 



sfi of a ftame preceding the paitioolar 



AmsthodofencodingasclaimedinanyoneoifliicclaiiaalOtfii 12, 
characterized inliiat the determining comprises calculating a difference between the 
corresponding parameters in the particular ftame and the frame preceding the particular 

firame. 

14. An encode* for coding a multi-channel audio signal comprising at least two 

audio channels, fbe encoder comprising! 

means £» generating a single channel audio signal, 

means fbr generating information from the at least two audio channels 
allowing to recover -with a required quality level the muM-ohannsl audio agnal ftomihe 

»n, ^ generating of the hifoiroaliQn 



co^ 



ituia 



Lsmg, 



means fbr determining a first portion of iiie information for a first fi^quenoy 
region of Hie multi-cbamisl audio si^oal, vd 
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5 



15. 




audio ohannels, the encoder compiisiiig, 

meaiis for generating a single ohaimel audio signal, 

rneau^ fox generating information from tbe at least two audio channels 

aUovdng to recover vvith a required qiudity level the mulliK^hamal audio signal from the 
10 single channel audio signal and ihe infoimation. the generating of ihe information 

comprising, 

means for receiving a maximum allowable bit rate of the encoded multi'* 

chaimel audio signal, and. 

means for only determining a fir^t portion of tiie information &r a first 
15 frequency region of the multi*<chaimel audio signal if a bit rate of the encoded muUi-cliaxmel 
audio sigiial comprising the single channel audio signal and the ifirst portion of the 
information is not higher than, the maximum allowable bit rate« 

■ 

16. An apparatus for supplying an audio dgnal» the apparatus comprising: 

20 an input for receiving an audio signal, 

an encoder as daimed in claim 14 or 1 S for encoding the audio signal to obtain 
an encoded audio signal^ and 

an oulput for sirpplying the encoded audio signal. 

25 17. An encoded audio signal comprising: 

a single channel audio signal;, 

information from the at least two audio channels allowing to recover with a 
required quality level the multi-channel audio signal from the single channel audio signal and 
tile information^ the information comprising* 
30 a first portion of the information for a first frequency region of ihe multi-* 

channel audio signal, and 

a second portion of tiie information for a second frequency re^on of tiie multi- 
channel audio signal^ the second frequency region being a portion of the first frieqnency 
regioxL 
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18. A stor^e medium an vAMi liie owoded aadio signal as olaii 

has been stored. 



A * 



in claim 17 



19, A meHiod of decoding a multi-dhannel aua© signal bdng encoded as ciaimfid 

in daim 17, the metiiod of djecoding compriang: 

obtaining a decoded sdn^e diisimel audio signal, 

obtaining deooded infoncaalion fiom tJie infoimation allowing to tecover ^ 
«« ^ ^ ripn,.! feB <1a.^odprt sinftlft channH m<^io pipnal aff| the dpcpded 
jnfft»^,tTVj dftfloded infotmadon oo ntorises the fltst por tion of &e infonnatioa and the 



second poirtion of the inlbnnation, and 

flying eatber the as* porticm offhfi inf^^ 
second portion of Ihe infbrmatloa on tt» single chanuel audio slgM to generate &e decoded 

nudti-chaimel aiu^o signal. 



20. 



A decoder fox decoding an encoded audio signal, the decoder oorapiistog: 
means fox obtaining a decoded sii^ channel audio aignal» 
means for obtaining decoded infijnnation ftom the infiwmatioa allowing to 
recover the mvilti-channel audio signal fiom li» decoded single cihannal audio signal and the 
decoded informatiott. Hie decoded infiamation comprises liie first portion of the infimnation 

and the second portion of the information, and 

means for applying tiie first pottian of the information and the second portion 
of the infiinnallon on the single channel audio signal to generate the decoded mnlti-dhannel 
andio signal 

21 . An apparatus for supplying a decoded audio signal, the apparatus comprisingt 

an input for receiving an encoded audio signal, 

a decoder as claimed in dahn 20 fbr decoding the encoded audio signal to 

obtain a nmtli-dunnd oidput si^^nal. and 

an outpirt for siqjplying or reproducing the multi-channel output sigflaL 
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ABSTRACT: 




informalion for a first frequflocy region of the miiii-clmmfil audio eignal^ OTid by detemiiniag 
5 a second portion of ihe infotmation for a second ftequency region of the mnlti-cbannel audio 
digoaL The second fi;equenoy re^on is a portion of the first firequency region and thus is a 
sub-range of ite first frequency legion. Ibe infoxmation is multi-layered enabling a scaling of 
the decoding quality versus bit rate. 

10 (Kg. <?) 
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